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A convolution model which accounts for neural activity dynamics in the primary visual cortex is 
derived and used to detect visually salient contours in images. Image inputs to the model are mod- 
ulated by long-range horizontal connections, allowing contextual effects in the image to determine 
visual saliency, i.e. line segments arranged in a closed contour elicit a larger neural response than 
line segments forming background clutter. The model is tested on 3 types of contour, including a 
line, a circular closed contour, and a non-circular closed contour. Using a modified association field 
to describe horizontal connections the model is found to perform well for different parameter values. 
For each type of contour a different facilitation mechanism is found. Operating as a feed-forward 
network, the model assigns saliency by increasing the neural activity of line segments facilitated by 
the horizontal connections. Alternatively, operating as a feedback network, the model can achieve 
further improvement over several iterations through cooperative interactions. This model has no 
dynamical stability issues, and is suitable for use in biologically-inspired neural networks. 

PACS numbers: 42.66.Si, 87.18.Sn, 87.85.dq, 07.05.Mh 



I. INTRODUCTION 

Salient features in an image have the defining property 
that they instantly attract our visual attention towards 
their locations without the need to scan the entire im- 
age systematically— they are image features which have 
a "pop out" effect 0]. In addition, such features are in 
some sense rare or unique — they have a low probability 
of being mistaken for other features in an image, and of- 
ten best distinguish one particular object category from 
another [214^. This property may allow the visual sys- 
tem to rapidly segment and classify an image based on 
its most distinctive features, or to devote further visual 
processing resources to specific points in an image. In 
evolutionary terms a rapid response to salient features 
could be critical for biological systems in predator-prey 
situations. In computer vision, saliency has been shown 
to improve performance in object and feature recognition 
tasks 0,13. 

Saliency is thought to be assigned early in the visual 
pathway, and models of the primary visual cortex (VI) 
are often used for this purpose [5|-Q. One reason is that 
saliency is assumed to involve only bottom-up (image- 
based) mechanisms which group simple features together 
and process a scene rapidly and in parallel over the entire 
visual field [H. Later stages of visual attention probably 
involve top-down (task-dependent) cues [10]. Long range 
horizontal connections in VI provide a likely mechanism 
for rapid integration of information across the visual field 
pj]4l3]. The salient objects we shall mostly be concerned 
with in this work are closed contours — such as the bound- 
aries of objects and surfaces in images. Simple cells in 
VI respond best to oriented bars and edges like those 
detected using a gabor filter [14], and a contour is repre- 
sented as a discrete set of line segments in VI. However, 
any given image will result in many line segments in VI, 



and most of these will be unrelated to any salient con- 
tour. Long-range horizontal connections modulate neural 
activity, and provide a mechanism for grouping together 
line segments that form salient contours. For example, 
two tangents are said to be co-circular when they are 
tangent to the same circle [l5[ . If horizontal connections 
facilitate co-circularity then neural activity will be higher 
(and the feature more salient) when line segments form 
a circular contour rather than a cluttered background of 
line segments with random orientations and positions, for 
example. 

Experiments on contour detection by the human vi- 
sual system have shown that contour saliency depends 
on factors such as contour length, curvature, degree of 
fragmentation, whether the contour is open or closed, 
and the statistical propert ies of the background relative 
to the contour p!2l, [l6l,Tl7t . Line segments which are co- 
linear (i.e. lie on a straight line and are orientated in the 
same direction as the line) are highly salient, but saliency 
decreases as the line segments are either separated or 
their relative orientations are changed The saliency 
of a contour can be increased either by increasing the 
contour length or by closing the contour, but decreases 
with increase in the contour curvature [l6] . A final point 
is that saliency also depends on the statistical properties 
of the background — false contours formed by fortuitously 
aligned background Hue segments will decrease contour 
saliency [l^- Field et. al. proposed an association field 
for grouping line segments that form contours [18[. We 
will make use of this in Sec. Illll when choosing a form for 
horizontal connections. 

The purpose of this work is to present a neural model 
with VI architecture that can be used to detect visu- 
ally salient contours in images. We would also like to 
determine what level of performance is possible in such 
tasks using a simple VI architecture. This architecture 
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includes functionally specific long-range horizontal con- 
nections, local feedback connections, thresholding, and 
neural orientation preference. A key motivation is to see 
how a biophysically-constrained model of VI can be used 
to assign saliency in the 10-20 ms period available under 
certain viewing conditions [l9|. We would also like to 
provide a solid framework for including horizontal con- 
nections into certain classes of biologically-inspired hier- 
archical neural networks used for object recognition (2Qj . 
We start with a model of neural activity (spike-rate) dy- 
namics such as used in Refs. 0] and (2ll-[23j for describ- 
ing populations of VI neurons. From these dynamical 
equations a convolution model is derived under the as- 
sumption that horizontal connections only modulate neu- 
ral activity rather than drive it. A convolution model 
has the advantage that it avoids the stability problems 
found in dynamical neural models 0, Q • Operating as 
a feed-forward network, the convolution model increases 
the activity of line segments facilitated by the horizontal 
connections. Alternatively, operating as a feedback net- 
work, it increases the activity of facilitated line segments 
over several iterations through cooperative interactions. 

Previous models of salient contour detection can be 
divided approximately into two classes: Vl-like neural 
models with horizontal connections [5-7], and non- neural 
models as typified by Refs. |2j-l29j . Although the non- 
neural models are interesting in their own right, here our 
primary interest is in biologically-inspired visual process- 
ing, and we will not further consider non-neural models. 
In Ref. 1, neural elements were represented by gabor 
patches, and a dynamical model with facilitatory hor- 
izontal connections and global inhibition was assumed. 
Horizontal connections involved fitting a cubic spline be- 
tween pairs of neighboring patches, then including penal- 
ties for distance, curvature, and change in curvature — 
such that co-linear and co-circular neighbors were facili- 
tated. A region was salient when its steady state response 
reached a certain out-lier criterion level separating back- 
ground noise from signal. In Ref. [6|, neural elements 
were grouped using a model with facilitatory horizontal 
connections and various forms of competitive inhibition. 
Horizontal connections were formed from an association 
field similar to that proposed in Ref. [l8] , and co-circular 
neighbors were facilitated. Global inhibition was used 
to suppress weakly facilitated elements, and the remain- 
ing elements were modeled as phase oscillators. Salient 
regions exhibited phase synchrony due to strong facilita- 
tory interactions. In Ref. [7], each edge segment repre- 
sented a pair of neural elements, one excitatory and one 
inhibitory, which were described using a dynamical model 
with horizontal connections. Facilitation came from an 
association field, and inhibition was given by so-called 
flank inhibition [3, [9[ . Saliency was related to the activity 
level at the steady state. In addition, the nonlinear dy- 
namical properties of this model caused gaps in contours 
to be filled in, and activity to leak out across boundaries 

The structure of this paper is as follows: In Sec. [Ill 



the basic model is derived and explained. In Sec. Illllthe 
model is used to find salient contours in example images 
which include a line, a circular closed contour, and a 
non-circular closed contour. In Sec. |TVl a summary and 
a discussion are given. 



II. MODEL 

The model we consider describes the processing of vi- 
sual information in a sequence of stages which reside in 
early parts of the visual pathway. Visual input from an 
image arrives at VI simple cells, which respond most 
strongly to visual features such as a bar or edge of a 
particular orientation. Dynamical processes then rapidly 
redistribute the neural input among the simple cells via 
functionally-specific long range horizontal connections. 
Once neural activity reaches equilibrium after approxi- 
mately 10 ms it is either directly output to later stages of 
the visual pathway, or else iterated several times through 
thresholding and local feedback connections before being 
output to later stages. The most salient regions in the 
image correspond to the regions of highest activity in the 
VI model output. We now describe the three stages in 
greater detail. 



A. Simple Cell Classical Receptive fields 

Visual input to the model comes from an image, and is 
given by the set of intensity values /(r) for each point r = 
(x, y) in the image. To construct the simple cell classical 
receptive fields, this input is convolved with the gabor 
function ^^(r) before thresholding is applied, giving 



Uir) = H{ g^v - v')I{r')dv' 
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and depends on the parameters (/), 7, a. A, and Lp [l4|. 
The integral in Eq. ([T]) is taken over the whole image, 
and is a unit step function satisfying: H{x) = 1 if 
X > and H{x) = {) \i x < n. The key parameter here 
is (j) — the neural orientation preference — which we have 
made explicit in Eq. ([T]). All other parameters will be 
considered as being constant for a particular image. 



B. Horizontal Connections and Neural Dynamics 

Simple-cell neural populations will be labeled by their 
position r G i?^, and orientation preference (j) G (0,7r). 
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The dynamics of simple-cell neural populations is de- 
scribed by the neural activity ii0(r, t), which corresponds 
to the local mean cell-body potential. Given a visual in- 
put /(/)(r) from the previous stage, neural activity is re- 
distributed according to long range orientation-specific 
horizontal connections (r — r'), which link cell popu- 
lations at r and (/), with cell populations at and (j)' . The 
equation for neural-activity dynamics has a similar form 
to that used in coarse-grained models of VI [fl, l2ll - [23| , 
and is given by 



d \ 

j ^00'(r-r)5W'(r,t))dr' + /4r), (5) 



where r 10 ms is the membrane time constant over 
which incoming action potential spikes are smoothed into 
pulses at the synapses of a neuron, and S is the popula- 
tion spiking rate, given by 



S{u) 



l + exp(-(w-6')/(j') 



(6) 



which is a sigmoid-shaped function, with spiking-rate 
threshold ^, and width a. The sum in Eq. (|5|) is over 
all neural orientation preferences — presumably forming 
a discrete set, and the integral is over all neural popula- 
tion positions. 

Solutions of Eq. (|5|) are guaranteed to converge to a 
stable fixed point if an associated Lyapunov function ex- 
ists [sOj- Alternatively, we can avoid the dynamical sta- 
bility problem by setting the time derivative in Eq. ([5j) 
to zero, and working with an equilibrium solution. We 
also let cr' ^ in Eq. (j6j) so that we can replace S in 
Eq. ([5j) with a unit step function H satisfying: H{x) = 1 
if X > 6>, and H{x) = if x < 6. The final step is to 
assume /^(r) > are the only points where '^^(r) > 6>, 
i.e. neural activity is only ever above threshold where 
the input is above threshold. This means horizontal con- 
nections modulate neural activity rather than drive it 
P^ll4l3|. We can now replace '^^(r) with /^(r) in the ar- 
gument of and Eq. (|5]) reduces to 



v')H{I^,{v'))dv' + I^{v). (7) 



This equation is our convolution model, giving the fa- 
cilitation due to horizontal connections. The horizontal 
connections have the form 



1^00/ (r) M00/i^(i?0/r), 



(8) 



where i?^ is the rotation matrix from Eq. ([3j), M^^/ is 
a matrix for the orientation preference specificity of the 
horizontal connections, and w(y) is a function that de- 
pends on the spatial distribution of the connections. 

Processing time is limited in many visual tasks where 
bottom-up saliency is required. The equilibrium given by 



Eq. ([7]) takes approximately 10 ms to attain, and neural 
activity can then be directly output to later parts of the 
visual pathway. Summing neural activity over all orien- 
tation preferences gives the total output as 



(9) 



and the regions of largest u{y) are the regions which are 
most salient in the image input. Alternatively, if a longer 
processing time is available, neural activity can be it- 
erated through thresholding and local feedback connec- 
tions, as described next. 



C. Thresholding and Local Feedback 

This stage of the model involves thresholding i^0(r) 
from Eq. ([7]) to select only the regions where it attains 
its largest values, returning the result to Eq. ([7]) as /^(r), 
then re-calculating ^/^(r), and so on. The pooling oper- 
ation of VI complex cells has previously been modeled 
by selecting the maximum from each group of simple cell 
outputs [20]. Here, we use inhibitory neural dynamics to 
perform a global thresholding of simple cell outputs. In 
a more detailed model, the pooling operation of VI com- 
plex cells could be used to perform a local thresholding. 
The equilibrium solution for spatially-uniform inhibitory 
dynamics is given by 



j;(r) = H{u^{v) - u), 



(10) 



where is a unit step function with threshold ^, and u 
determines the strength of spatially uniform inhibition. 
The solution given by Eq. (fTQ|) is a simple threshold- 
ing step: defining = then all ^/^(r) < -\- AO is 
discarded in Eq. (p^Oj) . This is consistent with selecting 
regions where ^/^(r) attains its largest values. 

Next, we include the effect of local feedback con- 
nections by replacing /^(r) in Eq. ([7]) with /^(r) from 
Eq. (p!Q|) . then use it to re-calculate ^/^(r). The whole 
process is repeated as many times as possible. Iterating 
Eqs. ([7]) and (p^Oj) will mean any visual input not facil- 
itated by the horizontal connections will eventually be 
discarded due to thresholding. The number of possible 
iterations is determined by the processing time available 
for the specific visual task being carried out. Assuming 
local feedback and thresholding act faster than horizontal 
integration, each iteration will require at least 10 ms — 
meaning that only a few iterations may be possible in 
many low level visual tasks involving saliency. 



III. DETECTING SALIENT CONTOURS IN 
EXAMPLE IMAGES 

The model given by Eqs. and ([7|)- (p!Q|) is now 

used to find salient contours in the example images shown 
in Fig. 1. A salient line made from 4 co-linear line 
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FIG. 1: Example images of salient contours within cluttered 
backgrounds, (a) Salient line made from 4 co- linear line seg- 
ments, (b) Salient circular closed contour, (c) Salient non- 
circular closed contour. 




FIG. 2: Plot of the function w{r) used in Eq. ©. The pa- 
rameters ri and r2 give the minimum and maximum range 
(arrows) of horizontal connections, ip is the angular disper- 
sion of excitation off the orientation preference axis (shown 
as the horizontal axis), while We and Wi are the strengths of 
excitation and inhibition, respectively. 

A. Form of Horizontal Connections 

Horizontal connections are parameterized to link neu- 
ral populations which are neighbors in both r and (j) — 
i.e. position and orientation preference. For any two pop- 
ulations with (j) and the elements of M^^/ in Eq. (j8j) 
are chosen as 

_ j 1, if 1^ (mod tt) - f I < (/) 

max •) /i 1 \ 

iv^cj^' - I otherwise, ^^^^ 

where mod tt denotes that a factor of tt can be added or 
subtracted from (j) in order to make the inequality true. 
The parameter (praax is the size of the ^-neighborhood 
of neural populations which are connected. The form of 
w{v) used in Eq. ([8|) is shown in Fig. 2, and is a modified 
form of the association field proposed in Ref. [18]. The 
spatial distribution of connections is anisotropic: strong 
excitation along the axis of orientation preference (shown 
as = in Fig. 2) means that co-linear line segments are 
strongly facilitated. Crude facilitation of co-circular line 
segments also takes place due to the angular dispersion of 
excitation off the orientation preference axis. The param- 
eters ri and r2 determine the minimum and maximum 
range of connections — ri is a short-range cutoff intended 
to eliminate self-excitation of short line segments, ip spec- 
ifies the angular dispersion of excitation off the orienta- 
tion preference axis, while We > ^ and Wi < {) are the 
strengths of excitation and inhibition, respectively. 



B. Salient Line 



segments is shown in Fig. 1(a) and will be treated in 
Sec. nil B| a salient circular closed contour is shown in 
Fig. 1(b) and will be treated in Sec. IIII C] while a salient 
non-circular closed contour is shown in Fig. 1(c) and will 
be treated in Sec. IIIIDI 



Now we apply our model to group co-linear line seg- 
ments which form a salient line. The image in Fig. 1(a) 
contains a single salient line made of 4 co-linear line seg- 
ments within a cluttered background of 104 line segments 
with random orientations and positions. The image is 
normalized to have side lengths of 1 unit, setting the 
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length scale for parameters describing the receptive field 
size and horizontal connection range. The size of each 
line segment is 0.0417x0.0125, and the value of /(r) is 1 
on a line segment, and elsewhere. 

To detect the salient line in Fig. 1(a) the first step is to 
find /(/)(r) — that is, for each value of (j) we find the set of 
line segments which trigger the largest neural activity re- 
sponse. We discretize (j) into k elements between and tt: 
^ = 0, 7r//c, 27r//c,..., {k — l)7r//c; and set k = 10. Apply- 
ing Eqs. ([I])-(j4]) to the image in Fig. 1(a) yields /^(r), as 
shown in Fig. 3(a) for all values of (j). This corresponds to 
the simple-cell neural activity. Next, applying Eqs. ([7])- 
(|9|) yields the neural activity u{v) from the feed- forward 
model, as shown in Fig. 3(b). The model parameters are 
given in the figure caption, and the values of and ri 
were chosen to facilitate co-linearity and a short-range 
cutoff (where only line segments separated by distance 
d > ri are facilitated). Several groups of co-linear line 
segments above the cutoff range ri have been facilitated 
by horizontal connections in Fig. 3(b), but only one of 
these corresponds to the salient line in Fig. 1(a) — the 
rest are fortuitous alignments of background clutter seg- 
ments. Finally, iterating Eqs. ([7]) and (p!Q]) 4 times yields 
u{v) from the feedback model, as shown in Fig. 3(c). Co- 
operative interactions between facilitated line segments 
take place over the 4 iterations until the only activity 
that remains corresponds to the salient line in Fig. 1(a). 
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C. Circular Closed Contour 

The model is now used to group line segments which 
form a closed contour that is approximately circular. The 
image in Fig. 1(b) contains a salient closed contour made 
from 10 line segments within a cluttered background of 
97 line segments with random orientations and positions. 
Image details are the same as in Sec. IIII B[ 

Applying Eqs. ([I|)-(|4|) to the image in Fig. 1(b) yields 
/^(r), as shown in Fig. 4(a) for all values of (j). Next, ap- 
plying Eqs. ([7j)-(|9j) yields the neural activity u{r) from 
the feed- forward model, as shown in Fig. 4(b). The model 
parameters are given in the figure caption, and the val- 
ues of ip and (j)max were chosen to facilitate fine segments 
which form circular arcs as well as lines. Several groups of 
line segments forming circular arcs and lines have been 
facilitated by horizontal connections in Fig. 4(b), how- 
ever, only one of these corresponds to the closed contour 
in Fig. 1(b). As shown in the previous subsection, the 
feed-forward model filters out much of the background 
clutter from the image, while retaining the salient fea- 
ture. 

The feedback model activity is shown in Fig. 4(c), and 
results from iterating Eqs. ([7|) and (p^Oj) 7 times with the 
simple cell activity from Fig. 4(a) and a different set of 
parameter values. It is seen that the only activity remain- 
ing corresponds to the salient closed contour in Fig. 1(b). 
The model parameters are given in the figure caption. 




FIG. 3: Saliency assigned by different stages of the VI model 
for the image in Fig. 1(a). (a) Simple cell activity /^(r) for 
all from Eqs. ©-(g)) with Fig. 1(a) for /(r). (b) Feed- 
forward activity u{r) from Eqs. dT])-®. Only highest activity 
is shown, (c) Feedback activity u{r) from iterating Eqs. (|7| 
and (|10p 4 times. Parameters are: k = 0.06, 7 = 0.5, a — 
0.008, A = 0.025, = 0, n = 0.125, r2 = 0.175, = 7r/12, 
We 0.5, Wi 0, (l)max 7r/12, 0.6, and u 0.44. 



and the values of r2, V^, and 4)rnax have been increased 
relative to their values in Fig. 4(b). The reason for this 
was to increase the size of the excitatory component of 
the horizontal connections in order to increase facilitation 
between neighboring line segments in the closed contour. 
This can be seen in Fig. 5, where the bottom line seg- 
ment of the closed contour is completely facilitated by 
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FIG. 4: Saliency assigned by different stages of the VI model 
for the image in Fig. 1(b). (a) Simple cell activity using 
Fig. 1(b) for /(r). (b) Feed- forward activity from Eqs. ([Tl)-®. 
Only highest activity is shown. Parameters are: ri = 0.042, 
r2 — 0.117, ip — 7r/6, Wi — —0.3, (l)max — 7r/5, and others as in 
Fig. 3. (c) Feedback activity from iterating Eqs. (0) and (|10|) 7 
times with simple cell activity from Fig. 5(a). Parameters are 
now: r2 0.133, 7r/4, ^max 7r/3, u 0.473 + 0.017n 
(for iterate n) and others as in (b). 



its left and right neighbors. A general principal now ap- 
plies: Each line segment that is part of a closed contour 
is facilitated by two neighbors, while line segments at 
the ends of an open contour are facilitated by only one 
neighbor. This principal allows cooperative interactions 
in the feedback model to group hue segments according to 



FIG. 5: Facilitation of the bottom line segment in Fig. 4(c) by 
its left and right neighbors, (a) and (b) show an enlargement 
of the excitatory component of horizontal connections (Grey) 
due to each neighbor. Parameters are as in Fig. 4(c). 

whether or not they form closed contours. Open contours 
get shorter and shorter with each iteration of feedback as 
line segments are discarded from the ends. A closed con- 
tour will also become inhibited less and less as nearby 
clutter segments are iteratively discarded. In order to 
maintain a balance between excitation and inhibition in 
this model, the strength of inhibition in the thresholding 
step needs to be increased as inhibition due to horizontal 
connections gradually decreases. This is done linearly in 
the number of iterations using the form for u given in 
Fig. 4. 



D. Non-Circular Closed Contour 

The model is now used to group hue segments which 
form a closed contour that is not even approximately cir- 
cular, but has an arbitrary shape. The image in Fig. 1(c) 
consists of several curved segments, some making up the 
closed contour and the rest forming background clutter. 
Applying Eqs. to the image in Fig. 1(c) yields 

/^(r), as shown in Fig. 6(a) for all values of <j). Next, ap- 
plying Eqs. ([7])-([9j) yields the neural activity u{r) from 
the feed- forward model, as shown in Fig. 6(b). The feed- 
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back model activity is shown in Fig. 6(c), and results 
from iterating Eqs. ([7]) and ([TQ|) 9 times. It is seen that 
this activity approximates the salient closed contour in 
Fig. 1(c). 

The model parameters are given in the figure caption, 
and the value of (pmax indicates that horizontal connec- 
tions are not orientation specific — line segments of all ori- 
entations are connected together. The reason is that the 
closed contour is not even close to being smooth or circu- 
lar, and has several regions where line segments actually 
meet at right angles instead of having smooth circular 
joins. Models which are limited to co-circular connec- 
tions would not be able to detect a closed contour in this 
case. 



IV. SUMMARY AND DISCUSSION 

In this work, our primary aim has been to establish a 
solid foundation for modeling horizontal connections in 
biological neural networks. Our model takes into account 
the known classical receptive fields of simple cells in VI, 
the activity dynamics of neural populations, and some 
of the basic properties of horizontal connections found in 
physiological and psychophysical studies. 

We have applied our model to the detection of salient 
contours in three different cases: a line, a circular closed 
contour, and a non-circular closed contour. Using differ- 
ent values of the model parameters the model performed 
well in all cases: assigning higher neural activity to line 
segments forming a salient contour, and lower activity to 
line segments forming background clutter. Values of the 
model parameters for each case indicated different facili- 
tation mechanisms taking place. Horizontal connections 
facilitating co-linear line segments and a short-range cut- 
off were useful for detecting salient lines. Facilitation 
of line segments forming circular arcs was found to be 
useful for detecting salient closed contours which were 
approximately circular. In addition, cooperative interac- 
tions were able to distinguish between open and closed 
contours — open contours getting shorter and shorter with 
each iteration as line segments were discarded from the 
ends, while closed contours remained unchanged. For 
the detection of non-circular closed contours facilitation 
of line segments of all orientations was found to be nec- 
essary. The reason was that there was no guarantee of 
having smooth circular joins between different line seg- 
ments in this contour. 

Finally, we showed how our model can operate as ei- 
ther a feed-forward or a feedback network. Operating 
as a feed-forward network, the model assigns saliency 
within 10 ms, and increases the neural activity of line 
segments facilitated by the horizontal connections. In 
Figs. 3(b) and 4(b), the feed- forward network filtered 
out much of the background clutter while retaining the 
salient contour. Operating as a feedback network, fur- 
ther improvement is achieved through cooperative inter- 
actions over several iterations. Further, our model does 




FIG. 6: (a) Simple cell activity using Fig. 1(c) for /(r). (b) 
Feed-forward activity from Eqs. ([7|)-(|9|). Only highest activity 
is shown, (c) Feedback activity from iterating Eqs. {|7| and 
(ITQI) 9 times. Parameters are: A: = 8, a = 0.017, A = 0.048, 
ri = 0.05, r2 = 0.117, ip = 7r/3, Wi = 0, (l)max — 7r/2, u — 
0.66, and others as in Fig. 3. 



not suffer from stability problems as dynamical neural 
models sometimes do. This makes it suitable for use 
in assigning saliency in biologically-inspired neural net- 
works performing object recognition. 

Several further questions remain. Firstly, it is difficult 
to know what model parameter values are actually used 
in computations involving VI horizontal connections. 
It seems unlikely that precise definitions such as co- 
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circularity are implemented by horizontal connections — 
this property would only apply to a small number of all 
salient contours. We favor a less stringent construction 
involving a modified association field. Also, scale invar i- 
ance has been observed in natural images. To include 
this in our model would require a range of receptive field 
sizes and horizontal connection ranges to be considered. 

The performance of our model in the successful detec- 
tion of a salient feature depends on details of the back- 
ground clutter relative to the feature. For example, op- 
erating as a feedback network, the number of iterations 
taken to distinguish between open and closed contours 
will depend directly on contour length. A long open con- 
tour will take a larger number of iterations to resolve 
than a short one. Similarly, detecting the most salient 
feature from a set of salient features could also take a 



large number of iterations. Addressing these questions, 
and using bottom-up saliency to robustly improve per- 
formance in object recognition tasks, will be the subject 
of future work. 
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